Contact us Category listing - textproc
((V)irtual = Package is only listed here)
aiksaurus English-language thesaurus
antiword Free MS Word to text and PostScript converter
asciidoc ASCII to formatted document converter
aspell Spell checker with good multi-language support
aspell-breton Breton language support for aspell
aspell-catalan Catalan language support for aspell
aspell-czech Czech language support for aspell
aspell-danish Danish language support for aspell
aspell-dutch Dutch language support for aspell
aspell-english English language support for aspell
aspell-esperanto Esperanto language support for aspell
aspell-faroese Faroese language support for aspell
aspell-francais French language support for aspell
aspell-gaeilge Irish language support for aspell
aspell-german German language support for aspell
aspell-greek Greek language support for aspell
aspell-italian Italian language support for aspell
aspell-norwegian Norwegian language support for aspell
aspell-polish Polish language support for aspell
aspell-portuguese Portuguese language support for aspell
aspell-romanian Romanian language support for aspell
aspell-russian Russian language support for aspell
aspell-slovak Slovak language support for aspell
aspell-spanish Spanish language support for aspell
aspell-svenska Swedish language support for aspell
aspell-ukrainian Ukrainian language support for aspell
aspell-welsh Welsh language support for aspell
astyle (V) Reindenter and reformatter of C++, C and Java source code
awf Text formatter (nroff-clone) written in awk
bibclean Prettyprinter and syntax checker for BibTeX bibliography databases
biblook Indexing and searching tools for BibTeX bibliography databases
bibparse Syntax checking tools for BibTeX bibliography databases
bsdgrep-devel BSD version of grep as in NetBSD src/usr.bin/grep
btparse BibTeX parsing library
c2html Converts a C source tree to hyperlinked and colored HTML
catdoc Converts MS Word, Excel and Powerpoint files to plain text
catdoc-tk Reads MS-Word file and puts out its content as plain text (Tk interface)
cawf Simplistic nroff-like formatter in C, like awf
cdif Word context diff
chasen ChaSen, Japanese Morphological Analysis System
chasen-base ChaSen, Japanese Morphological Analysis System
convertlit Convert Microsoft Legal Reader format eBooks into open format
crimson Apache.org implementation of JAXP, SAX, and DOM
db2latex Set of XSLT stylesheets converting DocBook to LaTeX2e
detex Remove LaTeX commands
dict-client Dictionary Service Protocol client
dict-dictionaries Dictionary data for DICTD
dict-server Dictionary Service Protocol server
dictem Dictionary client (RFC-2229) for [X]Emacs
diction GNU version of diction and style
diffsplit Splits a unified diff into pieces which patch one file each
diffstat Display a histogram of diff changes
docbook SGML DTD designed for computer documentation
docbook-simple Simplified DocBook XML DTD
docbook-website DocBook XML DTD for building websites
docbook-xml XML DTD designed for computer documentation
docbook-xsl Docbook XSL modular stylesheet
doclifter Translates documents written in troff macros to DocBook
dsssl-docbook-modular DSSSL stylesheets for the DocBook DTD
dtdparse Reads an SGML or XML DTD and constructs an XML database
eb C library for accessing EB, EBG, EBXA and EPWING CD-ROM dictionaries
eblook Interactive command-line interface for EPWING electric dictionaries
emacs-dict-client Emacs package for talking to a dictionary server
enca Extremely Naive Charset Analyser
enchant Generic spell checking library
expat XML parser library written in C
expatobjc Objective-C Wrapper for Expat
ezxml Easy to use C library for parsing XML documents
flyspell Emacs/Xemacs on-the-fly spell checker
fop The Apache Project's XSL Formatting Objects implementation
freepwing Free JIS X 4081 (subset of EPWING V1) formatter
gdome2 Gnome DOM (Document Object Model) engine
glimpse Text search engine
gnome-doc-utils Documentation utilities for the GNOME project
gnome-spell Spell checking as you type like gtkspell
grep GNU grep
groff GNU roff text processing suite
gsed GNU implementation of sed, the POSIX stream editor
gtk-doc Tools for authors of the GTK+ reference documentation
gtkspell Spell checking GtkTextView widget
GutenMark Automatic, high-quality Gutenberg text formatter to LaTeX or HTML
GutenMark-words Word lists for GutenMark
harmony Generic framework for reconciling disconnected updates to heterogeneous, replicated XML data
helpdeco Windows .hlp to .rtf converter
hevea LaTeX to HTML translator
hiawatha (V) Barebones HTTP server with XML and XSLT support (and more)
hre Hangeul Regular Expression Library
html SGML DTDs for the Hypertext Markup Language
html2text Advanced HTML-to-text converter
html2wml On-the-fly HTML to WML conversion
hugs-HaXml Haskell utilities for managing and generating XML documents (Hugs package)
hyperestraier Full-text search system for communities
icu Robust and full-featured Unicode services
intltool Toolbox for internationalisation
ipadic Japanese Morphological Dictionary for ChaSen
isearch Advanced text indexing and searching system
iso-codes List of country, language and currency names
iso12083 SGML DTDs from the The Electronic Publishing Special Interest Group
iso8879 Character entity sets from ISO 8879:1986 (SGML)
ispell-base Interactive spelling checker
ispell-british British dictionary for interactive spelling checker
ispell-catalan Catalan dictionary for interactive spelling checker
ispell-emacs Emacs interface for ispell spell checker
ispell-francais French dictionary for interactive spelling checker
ispell-gaeilge Irish language support for ispell
ispell-german German dictionary for interactive spelling checker
ispell-polski Polish dictionary for interactive spelling checker
ispell-romanian Romanian dictionary for ispell
ispell-russian Russian (KOI8-R) ispell dictonary from Alexander Lebedev
ispell-russian-io Russian (KOI8-R) ispell dictonary from Alexander Lebedev
ispell-slovak Slovak dictionary for ispell
ispell-spanish Spanish dictionary for interactive spelling checker
ispell-svenska Swedish dictionary for interactive spelling checker
itex2MML Converts itex equations to MathML
ja-grep GNU grep + multi-byte extension
ja-groff Japanese enhancement of GNU groff
ja-sed GNU sed + multi-byte extension
jade Object-oriented SGML/XML parser toolkit and DSSSL engine
java-mecab MeCab java module
jing RELAX NG validator in Java
kakasi Kanji-Kana Simple Inverter, language filter for Japanese
kbanner Display kanji files in large letters
kdoc C++ and IDL Class Documentation Tool
latex2html LaTeX to HTML converter
libcroco Toolkit to parse and manipulate CSS (Cascading Style Sheets)
liblrdf Library for easy manipulation of LADSPA plugin RDF descriptions
libpathan Library to parse and evaluate XPath expressions
libunicode Library for manipulating Unicode characters and strings
libxml XML parser (version 1), mainly used by the GNOME project
libxml++ C++ wrapper for the libxml XML parser library
libxml++2 C++ wrapper for the libxml XML parser library
libxml2 XML parser library from the GNOME project
libxslt XSLT parser library from the GNOME project
lout Basser Lout, a TeX/troff-like formatter with PostScript/PDF output
lq-sp Modified SP package
lua-expat XML parser for LUA based on expat
makeztxt ASCII text to Palm zTXT database converter
Markdown Text-to-HTML conversion tool for web writers
mecab Yet Another Part-of-Speech and Morphological Analyzer
mecab-base Yet Another Part-of-Speech and Morphological Analyzer
mecab-ipadic Japanese Morphological Dictionary for MeCab
mecab-jumandic Japanese Morphological Dictionary for MeCab
namazu2 Full-text search system intended for easy use
nbsed NetBSD-current's sed(1)
ndtpd Server for accessing CD-ROM books with NDTP
nxml-mode Major mode for editing XML documents for emacs
o3read Standalone converter for OpenOffice and OpenDocument file formats
openjade SGML/XML parser toolkit and DSSSL engine, successor to jade
opensp SGML parser, successor to sp
p5-Convert-ASCII-Armour Perl5 module to convert binary octets into ASCII armour
p5-Convert-ASN1 Perl5 module to encode/decode ASN.1 data
p5-Convert-BER Perl class to encode/decode objects using Basic Encoding Rules
p5-Convert-PEM Perl5 module to read/write ASN.1-encoded PEM files
p5-Cz-Cstools Tools for dealing with Czech and Slovak texts in Perl
p5-Data-FormValidator Validates user input based on input profile
p5-Date-Business (V) Perl5 module for fast calendar and business date calculations
p5-Date-Manip (V) Perl5 module for date calculations
p5-Encode Provides interfaces between strings and the rest of the system
p5-Encode-Detect Perl module that detects the encoding of data
p5-Feed-Find Perl module to perform autodiscovery of syndication feeds
p5-Filter Perl5 classes representing a number of source filters
p5-libxml Perl module collection for working with XML
p5-Lingua-EN-Inflect Perl module for inflection of english words and a/an selection
p5-Lingua-EN-Numbers-Ordinate Go from cardinal number (3) to ordinal (3rd)
p5-Lingua-EN-Sentence Perl module for splitting English text into sentences
p5-Lingua-Preferred Choose a preferred language from a selection
p5-Lingua-Stem-Snowball Lingua::Stem::Snowball - Perl interface to Snowball stemmers
p5-mecab MeCab perl module
p5-native-hyperestraier Perl interface of Hyper Estraier
p5-Net-Dict Client API for the DICT protocol defined in RFC2229
p5-Number-Format Perl extension for formatting numbers
p5-PDF Perl5 module for pdf document manipulation
p5-PDF-API2 Perl5 module for next generation api for pdf
p5-PDF-Create Perl5 module for creating pdf documents
p5-Pod-Coverage Checks if the documentation of a module is comprehensive
p5-Pod-Escapes Perl module for decoding Pod E<...> sequences
p5-Pod-POM P5 module to format POD into an object format, hence POM
p5-Pod-Simple Simple framework for parsing Pod
p5-Pod-Tests Perl5 module that extracts embedded tests and code examples from POD
p5-Pod-Tree Create a static syntax tree for a POD
p5-Regexp-Common Provide commonly requested regular expressions
p5-SGMLS Class for postprocessing the output from the sgmls and nsgmls parsers
p5-String-Approx Approximate (fuzzy) string matching library for Perl
p5-String-CRC32 Perl module to generate cksums from strings and from files
p5-String-ShellQuote Quote strings for passing through the shell
p5-Template-Plugin-Number-Format Plugin/filter interface to Number::Format
p5-Text-Autoformat Perl module for text wrapping and reformatting
p5-Text-Balanced Extract delimited text sequences from strings
p5-Text-BibTeX Perl library for reading, parsing, and processing BibTeX files
p5-Text-CharWidth Perl5 wrappers around wcwidth(3) and family
p5-Text-ChaSen Perl5 module to use ChaSen
p5-Text-Context-EitherSide Get n words either side of search keywords
p5-Text-CSV_XS Routines for composition and decomposition of comma-separated values
p5-Text-CSV-Hash Perl5 module for hash based CSV usage
p5-Text-DelimMatch Find regexp delimited strings with proper nesting
p5-Text-Diff High-level text diffing module for Perl
p5-Text-Diff-HTML HTML formatting class for Text::Diff
p5-Text-Emoticon Emoticon filter class
p5-Text-Emoticon-MSN Emoticon filter of MSN Messenger
p5-Text-Format Provide perl5 formatting functions on plain text
p5-Text-Glob Match globbing patterns against text
p5-Text-Kakasi Perl5 module to use Kakasi
p5-Text-LevenshteinXS XS implementation of the Levenshtein edit distance
p5-Text-Quoted Extract the structure of a quoted mail message
p5-Text-Reflow Reflowing of text using Knuth's paragraphing algorithm
p5-Text-Reform Manual text wrapping and reformatting
p5-Text-RewriteRules Perl 5 module to rewrite text using regexp-based rules
p5-Text-Shellwords Wrapper around shellwords.pl package
p5-Text-Substitute Perl5 module for text substitution from hashes
p5-Text-Tabs+Wrap Line wrapping to form simple paragraphs
p5-Text-Template Perl5 library for generating form letters
p5-Text-Textile Perl impementation of the Textile formatting language
p5-Text-Unaccent Perl5 module that removes accents from a string
p5-Text-vCard parse, edit and create vCards (RFC 2426)
p5-Text-vFile-asData parse vFile formatted files into data structures
p5-Text-WikiFormat Translate Wiki formatted text into other formats
p5-Text-WrapI18N Perl5 module to wrap internationalized text
p5-Text-Wrapper Perl5 module that provides simple word wrapping
p5-Tie-IxHash (V) Perl module that implements ordered in-memory associative arrays
p5-TimeDate (V) Perl5 TimeDate distribution
p5-XML-Atom Atom feed and API implementation
p5-XML-Atom-SimpleFeed Generate simple Atom syndication feeds
p5-XML-Atom-Stream Client interface for AtomStream
p5-XML-AutoWriter DOCTYPE-driven valid XML output
p5-XML-Checker Perl module for validating XML
p5-XML-Clean Ensure, that *(HTML)* text pass throught an XML parser
p5-XML-DOM Extend XML::Parser to build DOM Level 1 compliant data structure
p5-XML-Dumper Perl to XML structure input/output engine
p5-XML-Encoding Perl module for parsing XML encoding maps
p5-XML-Feed Perl syndication feed parser for both RSS and Atom feeds
p5-XML-Filter-BufferText Perl5 module XML parser filter to put all characters() in one event
p5-XML-Filter-DetectWS PerlSAX filter that detects ignorable whitespace
p5-XML-Filter-DOMFilter-LibXML Perl5 module SAX filter allowing DOM processing
p5-XML-Filter-Reindent Reformats whitespace for pretty printing XML
p5-XML-Filter-SAXT Replicates SAX events to several SAX event handlers
p5-XML-Grove Perl 5 module providing simple objects for parsed XML documents
p5-XML-Handler-Trees PerlSAX handlers for building tree structures
p5-XML-Handler-YAWriter Another Perl module for writing XML documents
p5-XML-LibXML Perl interface to the libxml2 library
p5-XML-LibXML-Common Routines and constants common for XML::LibXML and XML::GDOME
p5-XML-LibXML-Iterator Iterator for XML::LibXML parsed documents
p5-XML-LibXSLT Perl interface to the libxslt library
p5-XML-NamespaceSupport Perl module to the SAX2 NamespaceSupport class
p5-XML-Node Node-based XML parsing: an simplified interface to XML::Parser
p5-XML-NodeFilter Object that know how to filter out nodes
p5-XML-Parser Perl extension interface to James Clark's XML parser, expat
p5-XML-RAI Maps RSS tags to one common simplified interface
p5-XML-RegExp Provide regular expressions for some XML tokens
p5-XML-RSS XML-RSS helps to create and update RSS files
p5-XML-RSS-Parser Liberal object-oriented parser for RSS feeds
p5-XML-Sablotron Perl interface to the Sablotron XSLT processor
p5-XML-SAX Perl interface to the SAX2 XML Parser
p5-XML-SAX-Expat Perl SAX2 XML driver sitting on top of Expat (XML::Parser)
p5-XML-SAX-Writer SAX2 (XML) Writer
p5-XML-SemanticDiff Perl extension for comparing XML documents
p5-XML-Simple Easy Perl API to read/write XML
p5-XML-Stream XML::Stream provides you with access to XML Stream
p5-XML-Twig Efficient XML document interface
p5-XML-UM Convert UTF-8 strings to any encoding supported by XML::Encoding
p5-XML-Writer Perl module for writing XML documents
p5-XML-Writer-String Perl module for writing XML documents based on XML::Writer
p5-XML-Xerces Validating XML parser API for Perl
p5-XML-XPath XML XPath software
p5-XML-XQL Perl module to perform XQL queries on XML object trees
p5-XML-XSLT Perl5 module for processing XSLT
p5-XML-XUpdate-LibXML Simple implementation of XUpdate format based on and XML::LibXML
p5-YAML YAML implementation for Perl
p5-YAML-Syck Fast, lightweight YAML loader and dumper
par Paragraph reformatter, vaguely similar to fmt, but better
php-json PHP extension for JSON serialization support
php-pspell PHP extension for pspell support
php-wddx PHP extension for WDDX support
php4-domxml PHP4 extension for DOM support
php4-xslt PHP4 extension for XSLT functions (Sablotron backend)
php5-dom PHP5 extension for DOM support
php5-xsl PHP5 extension for XSLT functions
po4a Tool for using gettext where it was not intended to be used
postgresql-autodoc Generate HTML, DOT, and XML description of database structure
psgml-mode SGML/XML mode for Emacs
pxp Polymorphic XML parser, a validating XML-1.0 parser (OCaml)
py-cmTemplate Simple and fast Python template engine
py-csv CSV reading module for Python
py-docutils Python tool to generate documents
py-elementtree Read XML and HTML files into trees of Element objects
py-expat Python interface to expat
py-feedparser Parse RSS and Atom feeds in Python
py-FourSuite XML processing tools
py-gdick English-Korean Dictionary Client for GNOME2
py-gnosis-utils Classes for working with XML
py-HappyDoc Python tool to generate Python API documents
py-html2text Convert HTML into easy-to-read plain ASCII text
py-libxml2 Python wrapper for libxml2
py-libxslt Python wrapper for libxslt
py-markdown XHTML generator using a simple markup
py-mecab MeCab python module
py-Reverend General purpose Bayesian classifier
py-SimpleParse Simple parser generator for mxTextTools text-tagging engine
py-textile XHTML generator using a simple markup
py-X Package for the creation of PostScript and PDF files
py-xml Collection of libraries to process XML with Python
py-yaml Collection of libraries to process YAML with Python
qprint Encode and decode quoted-printable files
qsubst Query-replace strings in files
raptor RDF Parser Toolkit written in C
regexx C++ regular expression library
rfcutil Search for RFCs and do ports, services & protocol lookups
rman Produces HTML from formatted and unformatted man pages
robodoc Tool to support project documentation within source code
rtf-tools RTF to troff/groff/text converter
rtfm NetBSD documentation and GNU Texinfo files search mechanism
rubber Automated system for building LaTeX documents
ruby-amrita HTML/XHTML template library for Ruby
ruby-eruby Interprets Ruby code embedded in a text file
ruby-feed-normalizer Extensible Ruby wrapper for Atom and RSS parsers
ruby-ferret Text search engine library written for Ruby
ruby-hpricot Fast, enjoyable HTML parser for Ruby
ruby-html-parser HTML-parser package for Ruby
ruby-htree Tree data structure which represent HTML and XML data for Ruby
ruby-itex2MML Ruby binding for itex2MML
ruby-maruku Markdown-superset interpreter
ruby-mecab MeCab ruby module
ruby-native-hyperestraier Ruby native interface of Hyper Estaier
ruby-nqxml XML parser written in pure Ruby
ruby-pure-hyperestraier Ruby pure interface of Hyper Estaier
ruby-rdtool RD (Ruby Document) converter to HTML/man/etc
ruby-redcloth Textile library for Ruby
ruby-rttool RT to HTML (and hopefully LaTeX in future) table converter
ruby-simple-rss Simple, flexible, extensible, and liberal RSS and Atom reader
ruby-syntax Ruby lexical analysis framework
ruby-webunit (V) HTTP unit testing framework for Ruby
ruby-xmlparser Expat interface module for Ruby
ruby-xmlscan Pure Ruby XML parser
sablotron XML toolkit implementing XSLT, DOM, and XPath
saxon Michael H. Kay's Java XSLT processor
scew Light-weight DOM-like object model API for Expat
scrollkeeper Open Document Cataloging Project
siag (V) Poor man's office suite with spreadsheet, word processor, etc
source-highlight Highlight syntax of various languages source into HTML document
stardic English-Chinese dictionary
stow (V) Maps several separate packages into a tree without merging them
subtitleripper Sub title ripping program
tcl-dom DOM implementation for use with TclXML or TclExpat
tcl-expat XML parser implemented entirely in Tcl
tcl-xml XML parser implemented entirely in Tcl
tei DTD of the Text Encoding Initiative
teixsl-fo XSLT Stylesheets to convert TEI to XSL Formatting Objects
teixsl-html XSLT Stylesheets to convert TEI to HTML
tex-xmltex Non-validating XML parser implemented in TeX
tex2page Converts TeX manuscripts into (HTML) web pages
texi2html Texinfo-to-HTML direct translator
texi2roff Texinfo-to-ROFF direct translator
trang Multi-format schema converter based on RELAX NG
troffcvt Troff/groff to RTF/HTML/TEXT converter
unroff Programmable troff translator with backend for HTML
untex Remove LaTeX commands
urlview Extract URLs from text files and display them in a menu
vis Convert strings from/to a visual format
wdiff Word-by-word diff
writer2latex Convert OpenOffice.org/StarOffice documents to LaTeX and other formats
xalan-c XSLT processor of the Apache Project
xalan-j The Apache Project's XSLT implementation
xdvipresent (V) Slide Presentations Using LaTeX/xdvi
xerces-c Validating C++ XML parser with DOM and SAX support
xerces-j The Apache Project's validating XML parser with DOM and SAX support
xfce4-dict-plugin Xfce dictionary server plugin
xhtml DTDs for the Extensible Hypertext Markup Language
xhtmldiff Tool for generating valid XHTML redline documents
xml2doc Xml to document formats converter
xmlcatmgr XML and SGML catalog manager
xmlindent XML stream reformatter written in ANSI C
xmlrpc-c Library for writing an XML-RPC server or client in C or C++
xmlstarlet Command line utilities for XML manipulation
xmlto Tool to help transform XML documents into other formats
xp James Clark's non-validating XML Parser for Java
xslide XSL major mode for emacs
xt James Clark's Java implementation of XSLT
yodl High-level document preparation system